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Abstract. Image registration is a process to find the offset or misalignment be- 
tween two or more images for a certain area to determine the required geome- 
trical transformation that aligns points in one image with its corresponding in 
the other one. Generally, the operational goal of the registration process is a 
geometrical transformation for the input leading to geometrically agreement for 
input images, so that the matched pixels in the input images refer to the same 
region of the captured area. So, image registration can be applied in many ap- 
plications such as change detection, mosaicking, creating super-resolution im- 
ages etc. Registration process is divided into two categories: 1) Traditional me- 
thods and 2) Automated methods. For the traditional methods, the anchor, con- 
trol, points are selected manually and applying the transformation model lead- 
ing to time consuming and low accuracy. So, automatically detection of these 
points helps to recover the performance of manual selection. Registration 
process deals with many problems such as illumination changes, intensity varia- 
tions, Different sensors, noise etc. So, its applications are mainly dependent on 
errors (multi temporal, multi view, or multi modal) occurred during capturing 
process. Feature detection, as a step of image registration process, aims to find 
a set of stable (invariant) distinctive key points or regions under varying condi- 
tions. Also, it is critical for the detector to be robust to changes in viewpoint, 
brightness, and other distortions. The goal of current paper is discussion and 
exhibition of the common corner detectors helping to be familiar with the vari- 
ous applied techniques for feature detection 
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1 Introduction 


The process of image features detection and extraction is very important for 
representing the image. There are two main approaches for automatic registration. 
Firstly, Area Based Matching methods (ABM), may be called as coarse co- 
registration, is pixel-to-pixel registration process. ABM methods may be Correlation 
methods, Fourier methods, Mutual Information (MI) methods, and Optimization me- 
thods. ABM methods emphasize the existence of features rather than their detection, 
i.e. no features are detected. Secondly, Feature Based Matching methods (FBM) or 


fine co-registration as may be called, is subpixel-to-subpixel process. Each one of 
these two approaches has its own advantages and drawbacks [1- 5]. Features may be 
points (corners, interest), edges, blobs, texture, or color [6- 9]. In general, there are 
two types of features that can be detected, global and local features. Generally, the 
content of image is depicted totally using global features, whereas detecting the key 
points in the image requires the usage of local ones [6-10]. 

Global features have the drawback of image clutter and occlusions which limit the 
usefulness of the description. To handle the limitations of global features, its desired 
to do one of the following: 1) image segmentation in a limited number of regions for 
example the blob world system or 2) image sampling for different image subparts and 
this bordered the extent of potential applications [8, 10]. The expectation of local 
features being more useful than global ones is based on their specifications which are 
mainly dependent on ascendant performance, structure stability, and informativeness 
property [6, 7, 10]. In order to get correspondences among images, it is required to 
distinguish a set of noteworthy points in each one [8, 11]. Various techniques of fea- 
ture detection have been proposed and were designed for a certain application such as 
shape matching. The best execution is mainly dependent on the parameters of the 
detector [8-11]. The detectors must have specific characteristics depending on the 
application as robustness, repeatability, accuracy, generality, efficiency, and quantity. 
Good feature should be repeatable, informativeness, localized [2, 8-10]. It must be 
noticed that distinctiveness and locality are competing properties and cannot be ful- 
filled simultaneously [10, 11]. 

The corner can be illustrated as the intersection point of two or more different di- 
rections of rims [8, 10, 11]. Accuracy of features detection, specially corners, is diffi- 
cult under imperfections of data such as radiometric (illumination changes and/or 
different sensor characteristics) and geometric (scaling, translation, rotation, affine, 
projective, or nonlinear differences) differences. So, it is desired to detect the same 
features in the images in spite of these imperfections. Scaling, translation, and rotation 
problems can be easily recovered since there is no change in the shape of information. 
But; with respect to affine, projective, and nonlinear problems, the process is more 
difficult to be handled since the shape of information changed [8,9]. Therefore, fea- 
tures detection methodology should be insensitive to common geometric and radi- 
ometric changes. Commonly used techniques to acquire corners are contour curva- 
ture-based methods, intensities based on derivatives methods, exploitation of color 
information, human visual system, model-based methods, segmentation-based me- 
thods, and machine learning techniques [1-3, 8, 11]. Examination of the performance 
of a corner detector, basically; started with artificially created images with various 
types of junctions associated with different lengths, contrasts, angles etc. This ap- 
proach can make simplification for the evaluation process but cannot model all imper- 
fections which affect the performance of a detector in a real case, so; the results of 
performance are often over-optimistic [1, 8-11]. Generally; the evaluations of both 
performance and quality of different key point detection methods were discussed 
where merging several key point detection techniques is useful if it is handled well 
[1,8,11]. 


2 COMMON CORNER DETECTORS 


Feature detection methods can be classified into three groups: a) template-based cor- 
ner detection, b) contour-based corner detection, and c) direct corner detection [8-11]. 
Also, it can be categorized according to the operating scale into three categories: 
a) single scale, b) multi-scale, and c) affine invariant [1, 8-10]. As mentioned before, 
the aim of this paper is helping readers who is looking forward to be familiar with the 
different applied techniques used for feature detection. 


2.1 Moravec Corner Detection 


Moravec’s detector [12] is a single-scale Detector. It is one of the earliest approaches 
that specifies a key point to be that of low essence similarity. Each image’s pixel is 
investigated by correlating overlapping patches surrounding each neighboring pixel 
such that, the strength of correlation in any direction disclosures information about 
this point. Strength of corner is acquainted as the smallest sum of squared differences 
(SSD) between the patch and its neighbors for the three main directions (horizontal, 
vertical, and diagonals), where the moment at which SSD reaches a local maximum; a 
corner (interest point) is detected depending on the non-maximum suppression opera- 
tion [8-11]. Existence of noise can be recovered by smoothing operations. Both exis- 
tence of noise and increasing the size of window reduce the number of detected fea- 
ture points and subsequently number of symmetric points. When calculating differ- 
ences in eight directions, the output befits invariant to rotations that are a multiple of 
45 degree [9]. Advantages of moravec’s detector are as following: a) detecting the 
lump of the corners, and b) requires less computation than finding the central moment 
at a window of the same size. While, the disadvantages of moravec’s detector are: 
a) it is not isotopic, and b) its computing is shrill [8-10]. 


2.2 Harris Corner Detector 


Harris corner detector [13] is a single scale derivatives-based detector that had been 
advanced to handle the constraints of moravec’s detector by getting the variation of 
the auto-correlation (i.e. intensity variation) over all different orientations. It can be 
considered as a combination of both edge and corner detectors, mainly for gray scale 
images [8-10, 11, 13]. The gist idea is calculation of the eigenvalues for a small 
region. Then, using the largest two eigenvalues to estimate a specific function which 
is used together with a threshold to detect the corners which have an ill-defined 
gradient as shown in figure | [8, 9, 13]. The target of harris method is to find the 
direction of fastest and lowest change for feature orientation by using local directional 
derivatives covariance matrix [11, 13]. Also, an provides improvements over the Mo- 
ravec method was done by developing a detector based on harris to be utilized for the 
color information [2, 6, 7]. 

Harris corner detector has many advantages as following: 1) it is invariant to trans- 
lation, rotation and illumination change, 2) it is the most repetitive, most informative 
or discriminative, and precise localization, 3) it is the most stable one in many inde- 
pendent evaluations, 4) less computations requirements, 5) it provides high rotational 


invariance repeatability. The Disadvantages of harris corner detectors are: 1) it might 
be sensitive to noise, 2) its computations are more expensive, and 3) it is it is not inva- 
riant to large scale change (i.e. only repeatable up to relatively small-scale changes) 
[8-11]. Shi, Tomasi [14] was developed as an optimization based on harris method. 
Shi, Tomasi allow the usage of the minimum eigenvalues for differentiation and con- 
siderably thus regulates and facilitates the computation of harris [8, 11, 14]. 





Fig. 1. Illustration of gist idea of harris detector 


Since developing a detector robust to perspective transformations requires inva- 
riance to affine transformations, so; Harris Laplace / Harris Affine were developed, 
based on harris method, to act as a multi-scale affine invariant corner detector. The 
main idea, a combination of harris method and a gaussian scale space representation, 
allows detection of features over a range of scales. Their advantages are: 1) providing 
a representative set of points in the scale dimension, 2) invariant to scale changes, 
rotation, illumination, and addition of noise, and 3) interest points are highly repeata- 
ble. Their disadvantages are: 1) reduction the number of redundant interest points 
compared to multi-scale approaches as Laplacian of Gaussian (LoG) or Difference of 
Gaussian (DoG) detectors, and 2) failing in the case of affine transformations [8-11]. 

Also, a development, based on harris method, depending on local invariants was 
achieved allowing distinguish neighborhoods of interest points. The reason behind 
this trend is that edges are considered stable image features that can be detected over a 
multiple viewpoints, multiple scales and illumination changes. Moreover, by exploit- 
ing the edge geometry, the dimensionality of the problem can be significantly reduced 
[8-10]. 


2.3. SUSAN Corner Detectors 


Susan corner detector [15] is a single scale detector which does not smooth the image 
or use locative derivatives. It uses a morphological approach as a different technique 
rather than computing the local gradients which might be noise sensitive and compu- 
tationally more expensive. Smallest Univalue Segment Assimilating Nucleus 
(SUSAN) has been used as a corner detection and edge detection besides noise sup- 
pression where a circular mask is applied around every pixel, and checking the gray 


values for all pixels within the mask then compared to that of the centre pixel (nuc- 
leus) [8, 10, 11]. Corners can be distinguished as the locations where the number of 
pixels with similar intensity records in a local neighborhood catches a local minimum 
and below a specified threshold as shown in figure 2 where A is an edge, B is a cor- 
ner, and C is neither an edge nor a corner. Increasing robustness can be achieved us- 
ing weighting coefficient where Univalue Segment Assimilating Nucleus (USAN) 
represents the pixels having almost the same value as the nucleus are grouped togeth- 
er [8-11]. It is important to notice that it can work well when the noise is present since 
it does not require derivative [11, 15]. 

The advantages of SUSAN corner detector are as following: 1) its computations 
are fast since no derivatives are used, 2) it can be used as corner and edge detector in 
addition as image noise reduction, 3) it has good repeatability rate, 4) it is more effi- 
cient. 5) It is invariant to rotation, translation and illumination change. The disadvan- 
tages are: 1) it is not invariant to scaling and affine, 2) specifying a fixed threshold is 
not suitable for general applications; 3) it has low noise sensitivity, low robustness, 
and low localization, 4) it is less discriminative [2, 8-11]. 


2.4  Förstner Corner Detector 


Forstner corner detector [16] is a single scale detector, developed in a manner to com- 
pute the location of a corner up to sub-pixel accuracy. The main goal is finding the 
best solution for the point closest to all tangential lines of the corner in a given win- 
dow as shown in figure 3. For the ideal corner, all the tangential lines cut across at a 
unique point and this is its main gist. Férstner corner detector had been automatically 
adapted by Lindeberg [17] to have the ability to be multi-scale levels for computing 
the image gradients in the presence of the noise [8, 10]. 





Fig. 2. Computations of SUSAN method Fig. 3. Illustration of Férstner corner detector 


2.5 Robust Fuzzy Rule Corner Detector 


No doubt that corner detection for general purpose with high performance and effi- 
ciency is a challenge. Starting from this point, a few fuzzy approaches have specifi- 
cally discussed this problem [8, 10, 11]. An algorithm has been proposed by Banerjee 
& Kundu [18] to extract significant corner points in gray level where the measure- 
ment of cornerness for each point is computed by means of the vague edge strength. 
Changing the threshold value causing different corner sets, which are fuzzy as shown 
in figure 4 [8, 10]. The main advantage of Fuzzy Rule corner detector is that it can be 


easily extended to detect different types of features, while using different types of 
detectors which operate at different stages, causing a high computational load, can be 
considered its main drawback [8, 10]. 


2.6 Hessian Corner Detector 


Hessian detector [19] is a single scale blob detector, also it referred to as Determinant 
of Hessian (DoH) method which investigates positions in image that offer strong 
derivatives in two orthogonal directions. Its main idea stands behind searching for a 
point, center of searching window, whose determinant of hessian, based on the second 
derivative matrix, is maximum as shown in figure 5. Then, non-maximum suppres- 
sion is applied retaining only pixels whose value is greater than that of all eight im- 
mediate neighbors inside the window. Finally, all lasting points, whose value is above 
a specific predefined threshold, are returned by the detector [2, 4, 7, 11]. Determinant 
of Hessian (DoH) method is used in the popular SURF algorithm [20]. 

The advantages of hessian detector are: 1) regions with strong texture variation 
could be returned with many responses, 2) it can provide further interest points that 
result in a densely objects, 3) it has good performance and repeatability. The draw- 
backs of hessian detector are: 1) its returns are only repeatable up to relatively small- 
scale changes, 2) it is sensitive to noise leading to low localization accuracy causing 
the local maxima to be less stable [2,4, 8-11]. 

Hessian detector was developed by Mikolajczyk and Schmid [21] to operate as a 
scale invariant blob detector called Hessian-Laplace which is a combination of hes- 
sian detector and a gaussian scale space representation. The core is that it operates on 
local extrema, using DoH at multiple scales for spatial localization, and the Laplacian 
at multiple scales for scale localization. Also, it was developed to be affine invariant 
utilizing harris corner detector by integrating key points from several scales in a py- 
ramid with some iterative selection gauge, and a Hessian matrix [8, 9]. Hessian- 
Laplace concluded that the response of hessian detector exists if significant variations 
over any two perpendicular ways occurred. Also, for a single image, the repeatability, 
discernment more reliable regions, of hessian affine approach better than that of harris 
affine one. In addition, the hessian affine detector responds well to regions which 
have a lot of corner like parts (for example, hessian affine detector performs very well 
for buildings scenes. The advantages of hessian affine are: 1) extraction large number 
of features, 2) using spatial and scale localization is more suitable for scale estima- 
tion, and 3) Hessian detectors are more stable than the Harris-based counterparts [8- 
11]. Although these advantages, hessian affine has a drawback that its repeatability is 
low compared to Harris-Laplace [8, 10, 20]. 


2.7 FAST Corner Detector 


Rosten, E., Drummond [22] proposed Features from Accelerated Segment Test 
(FAST) which is a uni-scale corner detector. Interest points are identified by applying 
a part test to the intensity of every pixel in the image. This is achieved by considering 
a circle (bresenham circle) of sixteen pixels, as a base of computation, around the 
interest point [8, 10, 11]. The key point can be detected if its intensity value (I) is 


greater or smaller than the value of a collection of (n) neighboring pixels in bresen- 
ham circle as shown in figure 6, taking into account a predefined threshold [8]. 

Rosten, E., Drummond developed a high-speed test to eliminate a very large num- 
ber of non-corner points by investigating only the pixels numbers 1, 5, 9 and 13. If 
three of these four test pixels are brighter or darker than the intensity value (I), then a 
corner can only exist as a first step. For final result, the remaining pixels are ex- 
amined. FAST detector is similar to SUSAN, but uses a smaller window size in addi- 
tion to the fact that only some of pixels are examined instead of all of them. It can be 
considered a relative of the local binary pattern LBP [2, 7, 10, 11]. 

Fast detector has many advantages as it has high performance, high speed, efficient 
to compute and fast to match, a good accuracy, high repeatability, its ability to aver- 
age out noise is reduced and it is faster than other corner detectors. But it suffers from 
being not robust to high levels noise, depending on a threshold, probability to respond 
to one-pixel wide lines at certain angles, and it is not a scale-space detector [8, 10, 
22). 
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Fig. 4. Different Fuzzy Corner set Fig. 5. Both Hessian matrix and determinant 





Fig. 6. Illustration of operation of FAST corner detector 


2.8 Laplacian / LoG / DoG Detectors 


Laplacian processing, is a method of computing the maximum rate of change (deriva- 
tive) in a pixel. The Laplacian operator is described for a gray-scale image [8, 10, 11]. 
Laplacian of Gaussian (LoG) [23] is a multi-scale detector which is simply combines 
Laplacian operation over an area that had been processed using a Gaussian smoothing 
kernel to converge the edge [8, 9]. This means that LoG is a linear combination of 


second derivatives and a blob detector. The operator’s output is strongly dependent on 
the relationship between blob structures size and smoothing Gaussian kernel size 
where the standard deviation of the Gaussian is used to control the scale by changing 
the amount of blurring [8, 11]. LoG at a pixel in the image becomes locally maximum 
or minimum when the pixel represents the center of a dark or bright spot respectively 
[8-10]. Lindeberg [17] developed an automatically method, considered to be a local 
curvature detection, for capturing blobs of different size where a multi-scale approach 
with automatic scale selection is proposed where the gradient magnitude and the local 
surface curvature are both high. It is important to mention that the local curvature 
methods are the first known to be accurate and reliable in focusing corners over scale 
changes [8-11]. Advantage of LoG are: 1) it is invariant to rotation, 2) it provides 
somewhat a good rating for other features such as corners, ridges, etc., 3) the detected 
features can represent extended and splitting structures. Disadvantage of LoG are: 1) 
its computation is time consuming, 2) less stable, 3) more sensitive to noise [8, 9]. 

Difference of Gaussian (DoG) [24] is a multi-scale detector which is used to speed 
up the computation of LoG, i.e. DoG gives a close approximation to LoG. This can 
be accomplished by subtracting adjacent scale levels of a Gaussian pyramid separated 
by a certain factor without needing for convolution. DoG is used to efficiently detec- 
tion of stable features from scale-space, while it suffers from the same drawbacks of 
LoG [2, 6-9, 11]. 


2.9 Salient Regions (Entropy) Detection 


Kadir and Brady [25, 26] proposed a new algorithm which is based on thought that 
key points over a multi-scale should present a local characteristics (entropy) that are 
unexpected compared to the neighborhood region. The entropy method is mainly 
based on locally maximizing the information content, therefore, the most distinctive- 
ness points can be identified [8,9, 11]. The most important note for the entropy me- 
thod is being a rotationally invariant feature since location of pixel are not used in the 
calculation, therefore the entropy window will not alternate by rearrangement its pix- 
els with respect to the centered window [11, 26]. 


2.10 Gabor-Wavelet Corner Detector 


Gabor-Wavelet detector [27] is a multi-scale interest points detector which is based 
on the base of Gabor wavelets which are stimulated convolution kernels in the pattern 
of plane waves bounded by a Gaussian envelope function [8, 9, 11]. Gabor-Wavelet 
has many advantages as: 1) providing simultaneous optimal resolution in both spatial 
and space frequency domains, 2) capability of enhancing low level features such as 
peaks, ridges, etc., 3) capability of extraction of points from different scales, and 4) 
the detected points have high accuracy and adaptability to various geometric trans- 
formations [8, 9, 27]. 


2.11 Morphological Interest Regions Detection 


Morphological operations [28-29] can be rotation and scale invariant. The main usage 
of this method is creating an interest region on gray scale, binary, or color images, 


although preparing both gray scale and color images for morphology operations re- 
quires some form of pre-processing operations are used [2, 7-10, 26]. The binary thre- 
sholding necessitates a part of work to regulate the parameters correctly for a given 
application to get the highest performance. Generally, the morphological operations 
alone are not sufficient [2, 6-9, 26]. Since Maximally Stable Extremal Regions 
(MSER) [30], feature descriptor method, is based on the principle of connecting 
groups of pixels at maxima or minima locations, so the morphological interest regions 
method is similar to MSER. But it is important to know that MSER does not use mor- 
phology operators [8-11]. 


3 CONCLUSION 


This paper offers a survey of different key points (corner) detection methods for re- 
mote sensing images. The aim of this paper is helping readers who is looking forward 
to be familiar with the different applied techniques used for feature detection. Gener- 
ally, the process (method) of features selection determine the performance. Also, 
FBM methods is more efficient than ABM methods for reducing errors and consum- 
ing time for computations. 
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